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(2) INFORMATION FOR SEQ ID NO:l; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



( D ) TOPOLOGY ; 1 irte ar 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION; SEQ ID NO:l 
GAGAGATGGG GGAGGCTAAC TGAG 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic), 
(xi) SEQUENCE DESCRIPTION: SEQ NO: 2-: 
GATCCTCAGT TAGCCTCCCC CATCTCTC 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY = linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TATATCTCCA GATGAGGTAC ATGATTTTAG GCTTG 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 40 base pairs 

(B) TYPE.- nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
TATATATCGA TTCAAGGCAT TTTCTTTTCA TCAATAAAAC 

(2) INFORMATION FOR SEQ ID NO:S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE ; DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
TCGAGGATCC GCCCGGGCGG CCGCATCGAT GTCGACG 



(2) INFORMATION FOR SEQ ID NO:S: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CGCGTCGACA TCGATGCGGC CGCCCGGGCG GATCC 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY i linear 

(ii) molecule type i dna (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOr7: 

AGTGAATTCG AGCTCGGTAC CCGGGGATCC TCTAGAGTCG ACCTGCAGGC 

GGCGTACTCA TGGTCAT 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: a amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single " 
CD) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Ala Arg Glu Met Gly Glu Ala Asn 

1 5 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) strand edness ^ single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Dna (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCCGAGAGAT GGGGGAGGCT aactgag 

(2) INFORMATION FOR SEQ ID NO; 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) .TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO r 10: 
GGGCTCTCTA CCCCCTCCGA ttgacaccta g 



(2) INFORMATION FOR SEQ ID NO: 11 = 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(*;i) SEQUENCE DESCRIPTION r SEQ ID NO: 11: 
Thr lie Met Thr Met 
1 5 



(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic)' 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO* 12: 
CCCTGTGCCT TATTTGAACT AACC 

(2) INFORMATION. FOR SEQ . ID NO: 13: " 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NQ:13: 
CCCACCACAA CCACATATCC CTCC 

(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 
■ (A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO:14: 
CCAGTCCTCC GATTGACTG 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 8 332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear ■ 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 



GCGCCAGTCC 


TCCGATTGAC 


TGAGTCGCCC 


GGGTACCCGT 


GTATCCAATA 


AACCCTCTTG 




10 


CAGTTG CATC 


CGACTTGTGG 


TCTCGCTGTT 


CCTTGGGLAGG 


GTCTCCTCTG 


AGTGATTGAC 


i: 


0 


TACCCGTCAG 


CGGGGGTCTT 


TCATTTGGGG 


GCTCGTCCGG 


GATCGGGAGA 


CCCCTGCCCA 


a 


0 


GGGACCACCG ACCCACCACC 


GGGAGGTAAG 


CTGGCCAGCA ACTTATCTGT 


GTCTGTCCGA 


2< 


□ 


TTGTCTAGTG 


TCTATGACTG 


AUTTTATGCG 


CCTGCGTCGG 


TACTAGTTAG 


CTAACTAGCT 


3t 


0 


CTGTATCTGG 


CGGACCCGTG 


GTGGAACTGA 


CGAGTTCGGA 


ACACCCGGCC 


GCAACCCTGG 


3€ 


0 


GAGACGTCCC 


AGGGACTTCG 


GGGGCCGTTT 


TTGTGGCCCG 


ACCTGAGTCC 


AAAAATCCCG 


4: 


0 



ATCGTTTTGG ACTCTTTGGT GCACCCCCCT TAGAGGAGGG ATATGTGGTT CTGGTAGGAG 4^0 
ACGAGAACCT AAAACAGTTC CCGCCTCCGT CTGAATTTTT GCTTTCGGTT TGGGACCGAA 

GCCGCGCCGC GCGTCTTGTC TGCTGCAGCA TCGTTCTGTG TTGTCTCTGT CTGACTGTGT 6<}q 

TTCTGTATTT GTCTGAGAAT ATGGGCCAGA CTGTTACCAC TCCCTTAAGT TTGACCTTAG 6t I 

GTCACTGGAA AGATGTCGAG CGGATCGCTC ACAACCAGTC GGTAGATGTC AAGAAGAGAC 7: I 

GTTGGGTTAC CTTCTGCTCT GCAGAATGGC CAACCTTTAA CGTCGGATGG CCGCGAGACG 7f 0 

GCACCTTTAA CCGAGACCTC ATCACCCAGG TTAAGATCAA GGTCTTTTCA CCTGGCCCGC Q4 ( 

ATGGACACCC AGACCAGGTC CCCTACATCG TGACCTGGGA AGCCTTGGCT TTTGACCCCC 9C 0 

CTCCCTGGGT CAAGCCCTTT GTACAGCCTA AGCCTCCGCC TC CTCTTCCT CCATCCGCCC 9€ Q 

CGTCTCTCCC CCTTGAACCT CCTCGTTCGA CCCCGCCTCG ATCCTCCCTT TATCCAGCCC 102 0 

TCACTCCTTC TCTAGGCGCC AAACCTAAAC CTCAAGTTCT TTCTGACAGT GGGGGGCCGC 108 0 

TCATCGACCT ACTTACAGAA GACCCCCCGC CTTATAGGGA CCCAAGACCA CCCCCTTCCG 114 0 

ACAGGGAGGG AAATGGTGGA GAAGCGACGC CTGCGGGAGA GGCACCGGAC CCCTCCCCAA 120 0 

TGGCATCTGG CCTACGTGGG AGACGGGAGC CCCCTGTGGC CGACTCCACT ACCTCGCAGG 126 0 

CATTCCCCCT CCGCGCAGGA GGAAACGGAC AGCTTCAATA CTGGCCGTTC TCCTCTTCTG 132 0 

AC CTTTACAA CTGGAAAAAT AATAACCCTT CTTTTTCTGA AGATCCAGGT AAACTGACAG 138 3 

CTCTGATCGA GTCTGTTCTC ATCACCCATC AGCCCACCTG GGACGACTGT CAGGAGCTGT 144 D 

TGGGGACTCT GCTGACCGGA GAAGAAAAAC AACGGGTGCT CTTAGAGGCT AGAAAGGCGG 150 3 

TGCGGGGCGA TGATGGGCGC CCCACTCAAC TGC CCAATGA AGTCGATGCC GCTTTTCCCC - .156 3 

TCGAGCGCCC AGACTGGGAT TACACCACCC AGGCAGGTAG GAACCAC CTA GTC CACTATC 162 ) 

GCCAGTTGCT CCTAGCGGGT CTCCAAAACG CGGGCAGAAG CCCCACCAAT TTGGCCAAGG IS 3 ) 

TAAAAGGAAT AACACAAGGG CCCAATGAGT CTCCCTCGGC CTTCCTAGAG AGACTTAAGG 174 ) 

AAGCCTATCG CAGGTACACT CCTTATGACC CTGAGGACCC AGGGCAAGAA ACTAATGTGT l©0O 

CTATGTCTTT CATTTGGGAG TCTGCCCCAG ACATTGGGAG AAAGTTAGAG AGGTTAGAAG 18 SU 

ATTTAAAAAA CAAGACGCTT GGAGATTTGG TTAGAGAGGC AGAAAAGATC TTTAATAAAC 1920 

GAGAAACCCC GGAAGAAAGA GAGGAACGTA TCAGGAGAGA AACAGAGGAA AAAGAAGAAC 158 0 

GGCGTAGGAC AGAGGATGAG CAGAAAGAGA AAGAAAGAGA TCGTAGGAGA CATAGAGAGA 2 04 0 



• 



TGAGCAAGCT ATTGGCCACT GTCGTTAGTG GACAGAAACA GGATAGACAG GGAGGAGAAC 23 00 

GAAGGAGGTC CCAACTCGAT CGCGACCAGT GTGCCTACTG CAAAGAAAAG GGGCACTGGG 2^€0 

CTAAAGATTG TCCCAAGAAA CCACGAGGAC CTCGGGGACC AAGACCCCAG ACCTCCCTCC 22 20 

TGACCCTAGA TGACTAGGGA GGTCAGGGTC AGGAGCCCCC CCCTGAACCC AGGATAACCC 22 80 

TCAAAGTCGG GGGGCAACCC GTCACCTTCC TGGTAGATAC TGGGGCCCAA CACTCCGTGC 2340 

TGACCGAAAA TCCTGGACCC CTAAGTGATA AGTCTGCCTG GGTCCAAGGG GCTACTGGAG 24 00 

GAAAGCGGTA TCGCTGGACC ACGGATCGCA AAGTACATCT AGCTACCGGT AAGGTCACCC 2460 

ACTCTTTCCT CCATGTACCA GACTGTCCCT ATCCTCTGTT AGGAAGAGAT TTGCTGACTA 25 20 

AACTAAAAGC CCAAATCCAC TTTGAGGGAT CAGGAGCTCA GGTTATGGGA CCAATGGGGC 25 80 

AGCCCCTGCA AGTGTTGACC CTAAATATAG AAGATGAGCA TCGGCTACAT GAGAC CTCAA 26 4 0 

AAGAGCCAGA TGTTTCTGTA GGGTCCACAT GGCTGTCTGA TTTTCCTCAG GCCTGGGCGG 27 30 

AAACCGGGGG CATGGGACTG GCAGTTCGCC AAGCTCCTCT GATCATACCT CTGAAAGCAA 27 50 

CCTCTAGCCC CGTGTCCATA AAACAATAGC CCATGTCACA AGAAGCCAGA CTGGGGATCA 28 20 

AGCCCCAGAT AGAGAGACTG TTGGACCAGG GAATACTGGT ACCCTGCGAG TCCCCCTGGA 28 3 0 

ACACGCCCCT GCTACCCGTT AAGAAACCAG GGACTAATGA TTATAGGCCT GTCCAGGATG 29 10 

TGAGAGAAGT CAACAAGCGG GTGGAAGACA TCCACCCCAC CGTGCCCAAC CCTTACAACC 3 0 )0 

TCTTGAGCGG GCTCCCACCG TCCCACCAGT GGTACACTGT GCTTGATTTA AAGGATGCCT 30 SO 

TTTTCTGCCT GAGACTCCAC CCCaCCAGTC AGCCTCTCTT CGCCTTTGAG TGGAGAGATC 3i;!0 

CAGAGATOGG AATCTCAGGA CAATTGACCT GGACCAGACT CCCACAGGGT TTCAAAAACA 3U10 

GTCCCACCCT GTTTGATGAG GCACTGCACA GAGACCTAGC AGACTTCCGG ATCCAGCACC 32- : 0 

CAGACTTGAT CCTGCTACAG TACGTGGATG ACTTACTGCT GGCCGCCACT TCTGAGCTAG 33 00 

ACTGCCAACA AGGTACTCGG GCCCTGTTAC AAACCCTAGG GAACCTCGGG TATCGGGCCT 33 ("0 

CGGCCAAGAA AGCGCAAATT TGCCAGAAAC AGGTCAAGTA TCTGGGGTAT CTTCTAAAAG 341:0 

AGGGTCAGAG ATGGCTGACT GAGGC CAGAA AAGAGACTGT GATGGGGCAG CCTACTCCGA 34 £ 0 

AGACCCCTCG ACAACTAAGG GAGTTCCTAG GGACGGCAGG GTTCTGTCGC CTCTGGATCC 3 5^ 0 

CTGGGTTTGG AGAAATGGCA GCCCCCTTGT ACCCTCTCAC CAAAACGGGG ACTCTGTTTA 36(0 

ATTGGGGCCC AGACCAACAA AAGGCGTATC AAGAAATCAA GCAAGCTCTT CTAACTGCCC 3 6* 0 



CAGCCCTGGG GTTGCCAGAT TTGACTAAGC CCTTTGAACT CTTTGTCGAC GAGAAGCAGG 3' r 20 

GCTACGCCAA AGGTGTCCTA ACGCAAAAAC TGGGACCTTG GCGTCGGCCG GTGGCCTACC 3' 'SO 

TGTCCAAAAA GCTAGACCCA GTAGCAGCTG GGTGGCCCCC TTGCCTACGG ATGGTAGCAG 3* 4 0 

CCATTGCCGT ACTGACAAAG GATGCAGGCA AGCTAACCAT GGGACAGCCA CTAGTCATTC 3^0 0 

TGGCCCCCCA TGCAGTAGAG GCACTAGTCA AACAACCCCC CGACCGCTGG CTTTGCAACG 3S60 

CCCGGATGAC TCACTATCAG GCCTTGCTTT TGGACACGGA CCGGGTCCAG TTCGGACCGG 4(20 

TGGTAGCCCT GAACCCGGCT ACGCTGCTCC CACTGCCTGA GGAAGGGCTG CAACACAACT 4C 30 

GCCTTGATAT CCTGGCCGAA GCCCACGGAA CCCGACCCGA CCTAACGGAC CAGCCGCTCC 4 3 40 

CAGACGCCGA CCACACCTGG TACACGGATG GAAGCAGTCT CTTACAAGAG GGACAGCGTA 4; 00 

AGGCGGGAGC TGCGGTGACC ACCGAGACCG AGGTAATCTG GGCTAAAGCC CTGCCAGCCG 45 60 

GGACATCCGC TCAGCGGGCT GAACTGATAG CACTCACCCA GGCCCTAAAG ATGGCAGAAG 4j20 

GTAAGAAGCT AAATGTTTAT ACTGATAGCC GTTATGCTTT TGCTACTGCC CATATCCATG 43 SO 

GAGAAATATA CAGAAGGCGT GGGTTGCTGA CATCAGAAGG CAAAGAGATC AAAAATAAAG 44 40 

ACGAGATCTT GGCCCTACTA AAAGCCCTCT TTCTGCCCAA AAGACTTAGC ATAATCCATT 45 00 

GTCGAGGACA TCAAAAGGGA CACAGCGCCG AGGCTAGAGG CAACCGGATG GCTGAGCAAG 4560 

CGGCCCGAAA GGCAGCCATC ACAGAGACTC GAGACACCTC TACCCTCCTC ATAGAAAATT 45 20 

CATCACCCTA CACCTCAGAA CATTTTCATT ACACAGTGAC TGATATAAAG GACCTAACCA 4680 

AGTTGGGGGC CATTTATGAT AAAACAAAGA AGTATTGGGT CTACCAAGGA AAACCTGTGA 4740 

TGCCTGACCA GTTTAGTTTT GAATTATTAG ACTTTCTTCA TCAGCTGACT CACCTCAGCT 48 DO 

TCTCAAAAAT GAAGGCTCTC CTAGAGAGAA GCCACAGTCC CTACTACATG CTGAACCGGG 48 50 

ATCGAACACT CAAAAATATC ACTGAGACCT GCAAAGCTTG TGCACAAGTC AACGCCAGCA 49 20 

AGTCTGCCGT TAAACAGGGA ACTAGGGTCC GCGGGCATCG GCCCGGCACT CATTGGGAGA 49 30 

TCGATTTCAC CGAGATAAAG CCCGGATTGT ATGGCTATAA ATATCTTCTA GTTTTTATAG 50 10 

AT AC CTTTTC TGGCTGGATA GAAGCCTTCC CAACCAAGAA AGAAACCGCC AAGGTCGTAA SI 50 

CCAAGAAGCT ACTAGAGGAG ATCTTCCCCA GGTTCGGCAT GCCTCAGGTA TTGGGAACTG 51 

ACAATGGGCC TGCCTTCGTC TCCAAGGTGA GTCAGACAGT GGCCGATCTG TTGGGGATTG B^ko 

ATTGGAAATT ACATTGTGGA TACAGACCCC AAAGCTCAGG CCAGGTAGAA AGAATGAATA 52 \0 



GAACCATCAA GGAGACTTTA ACTAAATTAA CGCTTGCAAC TGGCTCTAGA GACTGGGTGC 5:40 
TCCTACTCCC CTTAGCCCTG TACCGAGCCC GCAACACGCC GGGCCCCCAT GGCCTCACCC 54 qq 
CATATGAGAT CTTATATGGG GCACCCCCGC CCCTTGTAAA CTTCCCTGAC CCTGACATGA s4S0 
CAAGAGTTAC TAAGAGGCCC TCTCTCCAAG CTCACTTACA GGCTCTCTAC TTAGTCCAGC 5^20 
ACGAAGTCTG GAGACCTCTG GCGGCAGCCT ACCAAGAACA ACTGGACCGA CCGGTGGTAC 5^80 
CTCACCCTTA CCGAGTCGGC GACACAGTGT GGGTCCGCCG ACACCAGACT AAGAACCTAG 5^40 
AACCTCGCTG GAAAGGACCT TACACAGTCC TGCTGACGAC CCCCACCGCC CTCAAAGTAG 57 00 
ACGGGATCGC AGCTTGGATA CACGCCGCCC ACGTGAAGGC TGCCGACCCC GGGGGTGGAC 57 
CATCCTCTAG ACTGACATGG CGCGTTCAAC GCTCTCAAAA CCCCTTAAAA ATAAGGTTAA 58 20 
CCCGCGAGGC CCCCTAATCC CCTTAATTCT TCTGATGCTC AGAGGGGTCA GTACTGCTTC 58 30 
GCCCGGCTCC AGTCCTCATC AAGTCTATAA TATCACCTGG GAGGTAACCA ATGGAGATCG 59 4 0 
GGAGACGGTA TGGGCAACTT CTGGCAACCA CCCTCTGTGG ACCTGGTGGC CTGACCTTAG 60fcf0 
CCCAGATTTA TGTATGTTAG CCCACCATGG AC CATCTTAT TGGGGGCTAG AATATCAATC SO 

CC CTTTTTCT tctcccccgg ggcccccttg ttgctcaggg GGCAGCAGCC CAGGCTGTTC 61 

CAGAGACTGC GAAGAACCTT TAACCTCCCT CACCCCTCGG TGCAACACTG CCTGGAACAG 61 

ACTCAAGCTA GACCAGACAA CTQATAAATC AAATGAGGGA TTTTATGTTT GCCCCGGGCC 62 

CCACCGCCCC CGAGAATCCA AGTCATGTGG GGGTCCAGAC TCGTTCTACT GTGCCTATTG 63 

GGGCTGTGAG ACAACCGGTA GAGCTTACTG GAAGCCCTCC TCATCATGGG ATTTCATCAC 63 

AGTAAACAAC ■ AATCTCACCT CTGACCAGGC TGTCCAGGTA TGCAAAGATA ATAAGTGGTG €4 

CAACCCCTTA GTTATTCGGT TTACAGACGC CGGGAGACGG GTTACTTCCT GGACCACAGG 64 41 

ACATTACTGG GGGTTACGTT TGTATGTCTC CGGACAAGAT CCAGGGCTTA CATTTGGGAT 65 

CCGACTCAGA TAG CAAAATC TAGGACCCCG CGTC CCAATA GGGCCAAACC CCGTTCTGGC 66 ( 

AGACCAACAG CCACTCTCCA AGCCCAAACC TGTTAAGTCG CCTTCAGTCA CCAAACCACC SSi 

CAGTGGGACT CCTCTCTCCC CTACCCAACT TCCACCGGCG GGAACGGAAA ATAGGCTGCT 6720 

AAACTTAGTA GACGGAGCCT ACCAAGCCCT CAACCTCACC AGTCCTGACA AAACCCAAGA 67^0 

GTGCTGGTTG TGTCTAGTAG CGGGACCCCC CTACTACGAA GGGGTTGCCG TCCTGGGTAC 68< 0 

CTACTCCAAC CATACCTCTG CTCCAGCCAA CTGCTCCGTG GCCTCCCAAC ACAAGTTGAC 69C 0 



I 



CCTGTCCGAA GTGACCGGAC AGGGACTCTG CATAGGAGCA GTTCCCAAAA CACATCAGGC 

CCTATGTAAT ACCACCCAGA CAAGCAGTCG AGGGTCCTAT TATCTAGTTG CCCCTACAGG 

TACCATGTGG GCTTGTAGTA CCGGGCTTAC TCCATGCATC TCCACCACCA TACTGAACCT 

TACCACTGAT TATTGTGTTC TTGTCGAACT CTGGCCAAGA GTCACCTATC ATTCCCCGAG 

CTATGTTTAC GGCCTGTTTG AGAGATCCAA CCGACACAAA AGAGAACCGG TGTCGTTAAC 

CCTGGCCCTA TTATTGGGTG GACTAACCAT GGGGGGAATT GCCGCTGGAA TAGGAACAGG 

GACTACTGCT CTAATGGCCA CTCAGCAATT CCAGCAGCTC CAAGCCGCAG TACAGGATGA 

TCTCAGGGAG GTTGAAAAAT CAATCTCTAA CCTAGAAAAG TCTCTCACTT CCGTGTCTGA 

AGTTGTCCTA CAGAATCGAA GGGGCCTAGA CTTGTTATTT CTAAAAGAAG GAGGGCTGTG 

TGCTGCTCTA AAAGAAGAAT GTTGCTTCTA TGCGGACCAC ACAGGACTAG TGAGAGACAG 

CATGGCCAAA TTGAGAGAGA GGCTTAATCA GAGACAGAAA CTGTTTGAGT CAACTCAAGG 

ATGGTTTGAG GGACTGTTTA ACAGATCCCC TTGGTTTACC ACCTTGATAT CTACCATTAT 

GGGACCCCTC ATTGTACTCC TAATGATTTT GCTCTTCGGA ccctgcattc TTAATCGATT 

AGTCCAATTT GTTAAAGACA GGATATCAGT GGTCCAGGCT CTAGTTTTGA CTCAACAATA 

TCACCAGCTG AAGCCTATAG AGTACGAGCC ATAGATAAAA TAAAAGATTT TATTTAGTCT 

CCAGAAAAAG GGGGGAATGA AAGACCCCAC CTGTAGGTTT GGCAAGCTAG CTTAAGTAAC 

GCCATTTTGC AAGGCATGGA AAAATACATA ACTGAGAATA GAGAAGTTCA GATCAAGGTC 

AGGAACAGAT GGAACAGCTG AATATGGGCC AAACAGGATA TCTGTGGTAA GCAGTTCCTG 

CCGCGGCTCA GGGCCAAGAA GAGATGGAAC AGCTGAATAT GGGCCAAACA GGATATCTGT" ■ 

GGTAAGCAGT TCCTGCCCCG GCTCAGGGCC AAGAACAGAT GGTCCCCAGA TGCGGTCCAG 

CCCTCAGCAG TTTCTAGAGA AC CATCAGAT GTTTCCAGGG TGCCCCAAGG ACCTGAAATG 

ACCCTGTGCC TTATTTGAAC TAACCAATCA GTTCGCTTCT CGCTTCTGTT CGCGCGCTTC 

TGCTCCCCGA GCTCAATAAA AGAGCCCACA ACCCCTCACT CGGGGCGCCA GTCCTCCGAT 

TGACTGAGTC GCCCGGGTAC CCGTGTATCC AATAAACCCT CTTGCAGTTG CA 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH = 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 



(D) TOPOLOGY = linear 
(ii) MOLECULE TYPE r DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
GGGAGTGGTA ACAGTCTGGC CTTAATTCTC AG 

(2) INFORMATION FOR SEQ ID NO: 17; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs p 

(B) TYPE ; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic)- 
(xi) SEQUENCE DESCRIPTION: SEQ I-D-NOslV 
CGGTCGACCT CGAGAATTAA TTC 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE i nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO=18: 
CTGGGAGACG TCCCAGGGAC TTC 



(2) INFORMATION FOR SEQ ID NO: 19: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GGCCAGACTG TTACCACTCC CTGAAGTTTG AC 

(2) INFORMATION FOR SEQ ID NO = 20 = 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 0 ; 
CATCGATAAA ATAAAAGATT TTATTTAGTC 



(2) INFORMATION FOR SEQ ID NO:2l: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2 2 base pairs 

(B) TYPEr nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21; 
CAAATGAAAG ACCCCCGCTG AC 



(2) INFORMATION FOR SEQ ID WO; 22; 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic). 
Cxi) SEQUENCE DESCRIPTION; SEQ ID NO:22: 
GAAGCTTCTC CCAGAACCCA CCAGTCTTGA AACGCCATC 

(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
(8) TYPE : nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY.- linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23; 
OTACCAGCTT TTGGTCTCAT CAAAG 

(2) INFORMATION FOR SEQ ID NO:24; 

fx) SEQUENCE CHARACTERISTICS 2 

(A) LENGTH: 36 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CTCCTCGAGC TAAAGATATT TTAGAGAAGA ATTAAC 

(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE 1 nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION : SEQ ID N0:25: 
TTCCTCTGGA CAGCTGTCTA CTTTG 

(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5l base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2$: 
TCGAGGGGCC CAGATCTGCG GCCGCTCGCG AGTCGACAAG CTTGGATCCA T 



(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 49 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 
(xi) SEQUENCE DESCRIPTION; SEQ ID NO:27: 
CGATGGATCC AAGCTTGTCG ACTCGCGAGC GGCCGCAGAT CTGGGCCCC 

(2) INFORMATION FOR SEQ ID NO: 28: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) -STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION 2 SEQ ID NO:28 = 
CACCGTCGTC GACTTATGCT 

(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE 1 DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: -SEQ ID NO : 29 : 
GACCGTCGAC TCAATTCTGG GAGAAGCTTC TTGG 

(2) INFORMATION FOR SEQ ID NO = 3 0: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CACCGTCGTC GACTTATGCT 

(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CAACGCTCGA GAAGCAGAAT CGCAAAAGGC 

(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH - 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) topology: linear 



Cii) MOLECULE TYPE; DNA ( genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
TCGGCTCGAG GCATCAACGG GAAATAACTC GT 



(2) INFORMATION FOR SEQ ID NO: 33: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE = nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE-. DNA (genomic)' 
(xi) SEQUENCE DESCRIPTION: SEQ ZD'N0:33; 
CCGACTCGAG TCAGTAGAGG TCCTGTGCCT C 

(2) INFORMATION FOR SEQ ID NO =34: 
(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GCGACTCGAG CATGGGGCCC TGGGGC 

(2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY- linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
GCACTGGAAT TCGTCAGGGC G 

(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs- ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CGCGCCGCTC GAGTCTACAA TGGCCTTGAC CTTTGCTTTA CTGG 

(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GCGCCCATCG ATTCATTCCT TACTTCTTAA ACTTTCTTGC AAG 



(2) INFORMATION FOR SEQ ID NO:3fl: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 7 base pairs 

(B) TYPE; nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic)^ 
(xi) SEQUENCE DESCRIPTION: SEQ ID* NO: 38: 
CGCGCCGCTC GAGCATCCAA, TGGCCCTGTC CTTTTCTTTA CTTATGG 



(2) INFORMATION .FOR SEQ-IDNO:39: - * 
(i) SEQUENCE CHARACTERISTICS = 

(A) LENGTH : 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE- DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
CCATCGATTC AATCCTTCCT CCTTAATCTT TTTTGCAAG 

(2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 
■ (A) LENGTH: 3 9 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION! SEQ ID NO:40: 
CCGGATCCTC TACAATGGCC TTGACCTTTG CTTTACTGG 

(2) INFORMATION FOR SEQ ID NO =41: 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
.(D) - TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NQ;41: 
CGCGCCGGCG GCCGCTCATT CCTTACTTCT TAAACTTTCT TGCAAG 

(2) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42i 
CCGGATCCCA TCCAATGG CC CTGTCCTTTT CTTTACTTAT GG 

(2) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH; 46 base pairs 

(B) TYPE: nucleic acid ■ 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
CGCGCCGGCG GCCGCTCAAT CCTTCCTCCT TAATCTTTTT TGCAAG 



46 



(2) INFORMATION FOR SEQ ID NO: 44: 
(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH; 90 BO base pairs 
(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single — 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

GGGGGGGGGG GGOOGGGGGG GGGTGAGCAC ATCCAGTGGG TAAAGTTCCT TAAAATGCTC 60 

TGCAAAGAAA TTGGGACTTT TCATTAAATC AGAAATTTTA CTTTTTTCCC CTCCTGGGAG L20 

CTAAAGATAT TTTAGAGAAG AATTAACCTT TTGCTTCTCC AGTTGAACAT TTGTAGCAAT .BO 

AAGTCATGCA AATAGAGCTC TCCACCTGCT TCTTTCTGTG CCTTTTGCGA TTCTGCTTTA :>4 0 

GTGCCACCAG AAGATACTAC CTGGGTGCAG TGGAACTGTC ATGGGACTAT ATGCAAAGTG 100 

ATCTCGGTGA GCTGCCTGTG GACGCAAGAT TTCCTCCTAG AGTGCCAAAA TCTTTTCCAT :i60 

TCAACACCTC AGTCGTGTAC AAAAAGACTC TGTTTGTAGA ATTCACGGAT CACCTTTTCA -.2 0 

ACATCGCTAA GCCAAGGCCA CCCTGGATGG GTCTGCTAGG TCCTACCATC CAGGCTGAGG -8 0 

TTTATGATAC AGTGGTCATT ACACTTAAGA AC ATGGCTT C CCATCCTGTC AGTCTTCATG '140 

CTGTTGGTGT ATCCTACTGG AAAGCTTCTG AGGGAGCTGA ATATGATGAT CAGACCAGTC <;00 

AAAGGGAGAA AGAAGATGAT AAAGTCTTCC CTGGTGGAAG CCATACATAT GTCTGGCAGG C6Q 

TCCTGAAAGA GAATGGTCCA ATGGCCTCTG ACCCACTGTG CCTTACCTAC TCATATCTTT "2 0 

CTCATGTGGA CCTGGTAAAA GACTTGAATT CAGGCCTCAT TGGAGCCCTA CTAGTATGTA -'80 

GAGAAGGGAG TCTGGCCAAG GAAAAGACAC AGAC CTTGCA CAAATTTATA CTACTTTTTG I 4 0 

CTGTATTTGA TGAAGGGAAA AGTTGGCACT CAGAAACAAA GAACTCCTTG ATGCAGGATA £ 00 

GGGATGCTGC ATCTGCTCGG GCCTGGCCTA AAATGCACAC AGTCAATGGT TATGTAAACA I 6 0 

GGTCTCTGCC AGGTCTGATT GGATGCCACA GGAAATCAGT CTATTGGCAT GTGATTGGAA 1C2 0 

TGGGCACCAC TC CTGAAGTG CACTCAATAT TCCTCGAAGG TCACACATTT CTTGTGAGGA 1C BO 

ACCATCGCCA GGCGTCCTTG GAAATCTCGC CAATAACTTT C CTTACTG CT CAAACACTCT 13 4 0 

TGATGGACCT TGGACAGTTT CTACTGTTTT GTCATATCTC TTCCCACCAA CATGATGGCA 12 00 

TGGAAGCTTA TGTCAAAGTA GACAGCTGTC CAGAGGAACC CCAACTACGA ATGAAAAATA 15 6 0 
. ATGAAGAAGC GGAAGACTAT GATGATGATC TTACTGATTC TGAAATGGAT GTGGTCAGGT " 13 2 0 

TTGATGATGA CAACTCTCCT TCCTTTAT C C AAATTCGCTC AGTTGCCAAG AAGCATCCTA 13 8 0 

AAACTTGGGT ACATTACATT GCTGCTGAAG AGGAGGACTG GGACTATGCT CCCTTAGTCC 14 4 0 

TCGCCCCCGA TGACAGAAGT TATAAAAGTC AATATTTGAA CAATGGCCCT CAGCGGATTG 15 0 0 

GTAGGAAGTA CAAAAAAGTC CGATTTATGG CATACACAGA TGAAACCTTT AAGACTCGTG 15 6 0 

AAGCTATTCA GCATGAATCA GGAATCTTGG GACCTTTACT TTATGGGGAA GTTGGAGACA 16 2 0 

CACTGTTGAT TATATTTAAG AATCAAGCAA GCAGACCATA TAACATCTAC CCTCACGGAA 168 0 

TCACTGATGT CCGTCCTTTG TATTCAAGGA GATTACCAAA AGGTGTAAAA CATTTGAAGG 17 4 0 

ATTTTCCAAT TCTGCCAGGA GAAATATTCA AATATAAATG GACAGTGACT GTAGAAGATG 18 0 0 

GGCCAACTAA ATCAGATCCT CGGTGCCTGA CCCGCTATTA CTCTAGTTTC GTTAATATGG 18 60 

AGAGAGATCT AGCTTCAGGA CTCATTGGCC CTCTCCTCAT CTGCTACAAA GAATCTGTAG 19 20 

ATCAAAGAGG AAACCAGATA ATGTCAGACA AGAGGAATGT CATCCTGTTT TGTGTATTTG 19 9 0 

ATGAGAACCG AAGCTGGTAC CTCACAGAGA ATATACAACG CTTTCTCCCC AATCCAGCTG 20 40 

GAG TG CAG CT TGAGGATCCA GAGTTCCAAG CCTCCAACAT CATGCACAGC ATCAATGGCT 2130 

ATGTTTTTGA TAGTTTGCAG TTGTCAGTTT GTTTGCATGA GGTGGCATAC TGGTACATTC 2150 

TAAGCATTGG AGCACAGACT GACTTCCTTT CtGTCTTCTT CTCTGGATAT ACCTTCAAAC 22 20 

ACA^AATGGT CTATGAAGAC ACACTCACCC TATTC CCATT CTCAGGAGAA ACTGTCTTCA 22 3 0 



)0 

;o 
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TGTCGATGGA AAACCCAGGT CTATGGATTC TGGGGTGC C A CAACTCAGAC TTTCGGAACA 2310 

GAGGCATGAC CGCCTTACTG AAGGTTTCTA GTTGTGACAA GAACACTGGT GATTATTACG 24 30 

AGGACAGTTA TGAAGATATT TCAGCATACT TGCTGAGTAA AAACAATGCC ATTGAACCAA 24 50 

GAAGCTTCTC CCAGAATTCA AGACACCCTA GCACTAGGCA AAAGCAATTT AATGCCACCA 25 20 

CAATTCCAGA AAATGACATA GAGAAGACTG ACCCTTGGTT TGCACACAGA ACACCTATGC 25 30 

CTAAAATACA AAATGTCTCC TCTAGTGATT TGTTGATGCT CTTGCGACAG AGTCCTACTC 26^0 

CACATGGGCT ATCCTTATCT GATCTCCAAG AAGCCAAATA TGAGACTTTT TCTGATGATG 27 

CATCAGCTGG AGCAATAGAC AGTAATAACA GCCTGTCTGA AATGACACAC TTCAGGCCAC 27 

AGCTCCATCA GAGTGGGGAC ATGGTATTTA CCCCTGAGTC AGGCCTCCAA TTAAGATTAA 2sko 

ATGAGAAACT GGGGACAACT GCAGCAACAG AGTTGAAGAA ACTTGATTTC AAAGTTTCTA 28 10 

GTACATCAAA TAATCTGATT TCAACAATTC CATGAGACAA TTTGGCAGCA GGTACTGATA 29*0 

ATACAAGTTC CTTAGGACCC CCAAGTATGC CAGTTCATTA TGATAGTCAA TTAGATACCA 30 

CTCTATTTGG CAAAAAGTCA TGTCCCCTTA CTGAGTCTGG TGGACCTCTG AGCTTGAGTG ' 30 

AAGAAAATAA TGATTCAAAG TTGTTAGAAT CAGGTTTAAT GAATAGCCAA GAAAGTTCAT 3lio 

GGGGAAAAAA TGTATCGTCA ACAGAGAGTG GTAGGTTATT TAAAGGGAAA AGAGCTCATG 311 10 

GACCTGCTTT GTTGACTAAA GATAATGCCT TATTCAAAGT TAGGATCTCT TTGTTAAAGA 32< 0 

CAAACAAAAC TTCCAATAAT TCAGCAACTA ATAGAAAGAC TCACATTGAT GGCCCATCAT 33 00 

TATTAATTGA GAATAGTCCA TCAGTCTGGC AAAATATATT AGAAAGTGAC ACTGAGTTTA 33 C 

AAAAAGTGAC AC CTTTGATT CATGACAGAA TGCTTATGGA CAAAAATGCT ACAGCTTTGA 34:. 

GGCTAAATCA TATGTCAAAT AAAACTACTT CATCAAAAAA CATGGAAATG GTCCAACAGA 34* , 

AAAAAGAGGG CCCCATTCCA CCAGATGCAC AAAATCCAGA TATGTCGTTC TTTAAGATGC 35- 

TATTCTTGCC AGAATCAGCA AGGTGGATAC AAAGGACTCA TGGAAAGAAC TCTCTGAACT 36 C 

CTGGGCAAGG CCCCAGTCCA AAGCAATTAG TATCCTTAGG ACCAGAAAAA TCTGTGGAAG 3 6€ 

GTCAGAATTT CTTGTCTGAG AAAAACAAAG TGGTAGTAGG AAAGGGTGAA TTTACAAAGG 371 

ACGTAGGACT CAAAGAGATG GTTTTTCCAA GCAGCAGAAA CCTATTTCTT ACTAACTTGG 37 i 

ATAATTTACA TGAAAATAAT ACACACAATC AAGAAAAAAA AATTCAGGAA GAAATAGAAA 38 < 

AGAAGGAAAC ATTAATCCAA GAGAATGTAG TTTTGCCTCA GATACATACA GTGACTGGCA 39( 

CTAAGAATTT CATGAAGAAC CTTTTCTTAC TGAGCACTAG GCAAAATGTA GAAGGTTCAT 39£ 

ATGACGGGGC ATATGCTCCA GTACTTCAAG ATTTTAGGTC ATTAAATGAT TCAACAAATA 4 02 

GAACAAAGAA ACACACAGCT CATTTCTCAA AAAAAGGGGA GGAAGAAAAC TTGGAAGGCT 40£ 

TGGGAAATCA AACCAAGCAA ATTGTAGAGA AATATGCATG CACCACAAGG ATATCTC CTA 414 

ATACAAGCCA GCAGAATTTT GTCACGCAAC GTAGTAAGAG AGCTTTGAAA CAATTCAGAC 42C 0 

TCCCACTAGA AGAAACAGAA CTTGAAAAAA GGATAATTGT GGATGAGACC TCAACCCAGT 426 0 

GGTCCAAAAA CATGAAACAT TTGACCCCGA GGACCCTCAC ACAGATAGAC TACAATGAGA 432 0 

AGGAGAAAGG GGCCATTACT CAGTCTCCCT TATCAGATTG CCTTACGAGG AGTCATAGCA 438 0 

TCCCTCAAGC AAATAGATCT CCATTACCCA TTGCAAAGGT ATCATCATTT CCATCTATTA 444 0 

GACCTATATA TCTGACCAGG GTCCTATTCC AAGACAACTC TTCTCATCTT CCAGCAG CAT . 450 0 

CTTATAGAAA GAAAGATTCT GGGGTCCAAG AAAGCAGTCA TTTCTTACAA GGAGCCAAAA 456 0 

AAAATAACCT TTCTTTAGCC ATTCTAACCT TGGAGATGAC TGGTGATCAA AGAGAGGTTG 462 0 

GCTCCCTGGG GACAAGTGCC ACAAATTCAG TCACATACAA GAAAGTTGAG AACACTGTTC 4 68 0 

TCCCGAAACC AGACTTGCC C AAAACATCTG GCAAAGTTGA ATTGCTTCCA AAAGTTCACA 474 0 

TTTATCAGAA GGACCTATTC CCTACGGAAA CTAGCAATGG GTCTCCTGGC CATCTGGATC 4 6 00 

TCGTGGAAGG GAGCCTTCTT CAGGGAACAG AGGGAGCGAT TAAGTGGAAT GAAGCAAACA 486 0 

GACCTGGAAA AGTTCCCTTT CTGAGAGTAG CAACAGAAAG CTCTGCAAAG ACTCCCTCCA 492 0 

AGCTATTGGA TCCTCTTGCT TGGGATAACC ACTATGGTAC TCAGATACCA AAAGAAGAGT 4 98 3 

GGAAATCCCA AGAGAAGTCA CCAGAAAAAA CAGCTTTTAA GAAAAAGGAT ACCATTTTGT 504 0 

CCCTGAACGC TTGTGAAAGC AATCATGCAA TAGCAGCAAT AAATGAGGGA CAAAATAAGC 510 0 

CCGAAATAGA AGTCACCTGG GCAAAGCAAG GTAGGACTGA AAGGCTGTGC TCTCAAAACC 516 3 

CACCAGTCTT GAAACGCCAT CAACGGGAAA TAACTCGTAC TACTCTTCAG TCAGATCAAG 522 3 

AGGAAATTGA CTATGATGAT ACCATATCAG TTGAAATGAA GAAGGAAGAT TTTGACATTT 523 3 

ATGATGAGGA TGAAAATCAG AGCCCCCGGA GCTTTCAAAA GAAAACACGA CACTATTTTA 534 3 

TTGCTGCAGT GGAGAGGCTC TGGGATTATG GGATGAGTAG CTCCCCACAT GTTCTAAGAA 54 0 3 

ACAGGGCTCA GAGTGGCAGT GTCCCTCAGT TCAAGAAAGT TGTTTTCCAG GAATTTACTG 546 D 

ATGGCTCCTT TACTCAGCCC TTATACCGTG GAGAACTAAJ^ TGAACATTTG GGACTCCTGG 552 3 
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GGC CATATAT AAGAGCAGAA GTTGAAGATA ATATCATGGT AACTTTCAGA AATCAGGCCT 55 8 0 

CTCGTCGCTA TTCCTTCTAT TCTAGCCTTA TTTCTTATGA GGAAGATCAG AGGCAAGGAG 56 4 0 

cagaacctag aaaaaacttt gtcaagccta atgaaaccaa AACTTACTTT TGGAAAGTGC 57 00 

AACATCATAT GGCACCCACT AAAGATGAGT TTGACTGCAA AGCCTGGGCT TATTTCTCTG 57 60 

ATGTTGACCT GGAAAAAGAT GTGCACTCAG GCCTGATTGG AGCCCTTCTG GTCTGCCACA 58 2 Q 

CTAACACACT GAACCCTGCT CATGGGAGAC AAGTGACAGT ACAGGAATTT GGTCTG1TTT 5880 

TCACCATCTT TGATGAGACC AAAAGCTGGT ACTTCACTGA AAATATGGAA AGAAACTGCA 5940 

GGGCTCCCTG CAATATCCAG ATGGAAGATC CCACTTTTAA AGAGAATTAT CGCTTCCATG 6000 

CAATCAATGG CTACATAATG GATACACTAC CTGGCTTAGT AATGGCTCAG GATCAAAGGA 60 50 

TTCGATGGTA TCTGCTCaGC ATGGGCAGCA ATGAAAACAT CCATTCTATT CATTTCAGTG 6120 

GACATGTGTT CACTGTACGA AAAAAAGAGG AGTATAAAAT GGCACTGTAC AATCTCTATC 6180 

CAGGTGTTTT TGAGAGAGTG GAAATGTTAC CATCCAAAGC TGGAATTTGG CGGGTGGAAT 62 40 

GGCTTATTGG CGAGCATCTA CATGCTGGGA TGAGCACACT TTTTCTGGTG TACAGCAATA ' 63 30 

AGTGTCAGAC TCGCCTGGGA ATGGCTTCTG GACACATTAG AGATTTTCAG ATTACAGCTT 63 50 

CAGGACAATA TGGACAGTGG GCCCCAAAGC TGGCCAGACT TCATTATTCC GGATCAATCA 64 20 

ATGCCTGGAG CACCAAGGAG CCCTTTTCTT GGATCAAGGT GGATCTGTTG GCACCAATGA 64 30 

TTATTCACGG CATCAAGACC CAGGGTGCCC GTCAGAAGTT CTCCAGCCTC TACATCTCTC 65 10 

AGTTT AT CAT CATGTATAGT CTTGATGGGA aGaaGTGGCA GACTTATCGA GGAAATTCCA 66 30 

CTGGAACCTT AATGGTCTTC TTTGGCAATG TGGATTCATC TGGGATAAAA CACAATATTT 66 50 

TTAACCCTCC AATTATTGCT CGATACATCC GTTTGCACCC AACTCATTAT AGCATTCGCA 67 20 

GCACTCTTCG CATGGAGTTG ATGGGCTGTG ATTTAAATAG TTGCAGCATG CCATTGGGAA 67 30 

TGGAGAGTAA AGCAATATCA GATGCACAGA TTAGTGCTTC ATCCTACTTT AC CAATATGT 68 10 

TTGCCACCTG GTCTCCTTCA AAAGCTCGAC TTCACCTCCA AGGGAGGAGT AATGCCTGGA 69 50 

GACCTCAGGT GAATAATCCA AAAGAGTGGC TGCAAGTGGA CTTCCAGAAG ACAATGAAAG 6 9 10 

TCACAGGAGT AACTACTCAG GGAGTAAAAT CTCTGCTTAC CAGCATGTAT GTGAAGGAGT 70:10 

TCCTCATCTC CAGCAGTCAA GATGGC CATC AGTGGACTCT CTTTTTTCAG AATGGCAAAG 7010 

TAAAGGTTTT TCAGGGAAAT CAAGACTCCT TCACACCTGT GGTGAACTCT CTAGACCCAC 71. L0 

CGTTACTGAC TCGCTACCTT CGAATTCACC CCCAGAGTTG GGTGCACCAG ATTGCCCTGA 7200 

GGATGGAGGT TCTGGGCTGC GAGGCACAGG ACCTCTACTG AGGGTGGCCA CTGCAGCACC 72 £0 

TGCCACTGCC GTCACCTCTC CCTCCTCAGC TCCAGGGCAG TGTCCCTCCC TGGCTTGCCT 73! 0 

TCTACCTTTG TGCTAAATCC TAGCAGACAC TGCCTTGAAG CCTCCTGAAT TAACTATCAT 73 U0 

CAGTCCTGCA TTTCTTTGGT GGGGGGCCAG GAGGGTGCAT CCAATTTAAC TTAACTCTTA 74. Q 

CCTATTTTCT GCAGCTGCTC CCAGATTACT CCTTCCTTCC AATATAACTA GGCAAAAAGA 75(10 

AGTGAGGAGA AACCTGCATG AAAGCATTCT TCCCTGAAAA GTTAGGCCTC TCAGAGTCAC 75(10 

CACTTCCTCT GTTGTAGAAA AACTATGTGA TGAAACTTTG AAAAAGATAT TTATGATGTT 76: iO 

AACATTTCAG GTTAAGCCTC ATACGTTTAA AATAAAACTC TCAGTTGTTT ATTATCCTGA 76* 0 

TCAAGCATGG AACAAAGCAT GTTTCAGGAT CAGATCAATA CAATCTTGGA GTGAAAAGGC 77< 0 

AAATCATTTG .GACAAXCTGC AAAATGGAGA GAATACAATA ACTACTACAG TAAAGTCTGT 78 C O 

TTCTGCTTCC TTACACATAG ATATAATTAT GTTATTTAGT CATTATGAGG GGCACATTCT 786 0 

TATCTCCAAA ACTAGCATTC TTAAACTGAG AATTATAGAT GGGGTTCAAG AATCCCTAAG 79*0 

TCCCCTGAAA TTATATAAGG CATTCTGTAT AAATGCAAAT GTGCATTTTT CTGACGAGTG 796 0 

TCCATAGATA TAAAGCCATT TGGTCTTAAT TCTGACCAAT AAAAAAATAA GTCAGGAGGA 8 0< 0 

TGCAATTGTT GAAAGCTTTG AAATAAAATA ACAATGTCTT CTTGAAATTT GTGATGGCCA 81C 0 

AGAAAGAAAA TGATGATGAC ATTAGGCTTC TAAAGGACAT ACATTTAATA TTTCTGTGGA 816 0 

AATATGAGGA AAATCCATGG TTATCTGAGA TAGGAGATAC AAACTTTGTA ATTCTAATAA Q22 0 

TGCAGTGAGT TTACTCTCTC CCTCTACTAA TTTCCTGCTG AAAATAACAC AACAAAAATG 8 2£ 0 

TAACAGGGGA AATTATATAC CGTGACTGAA AACTAGAGTC CTACTTACAT AGTTGAAATA 834 0 

TCAAGGAGGT CAGAAGAAAA TTGGACTGGT GAAAACAGAA AAAACACTCC AGTCTGC CAT 84 CO 

ATCACCACAC AATAGGATCC CCCTTCTTGC CCTCCACCCC CATAAGATTG TGAAGGGTTT 846 0 

ACTGCTCCTT CCATCTGCCT GACCCCTTCA CTATGACTAC ACAGAATCTC CTGATAGTAA 852 0 

AGGGGGCTGG AGGCAAGGAT AAGTTATAGA GCAGTTGGAG GAAGCATCCA AAGATTGCAA 858 0 

CCCAGGGCAA ATGGAAAACA GGAGATCCTA ATATGAAAGA AAAATGGATC CCAATCTGAG 8 64 0 

AAAAGGCAAA AGAATGGCTA CTTTTTTCTA TGCTGGAGTA TTTTCTAATA ATCCTGCTTG 8 700 

ACCCTTATCT GACCTCTTTG GAAACTATAA CATAGCTGTC ACAGTATAGT CACAATCCAC 876 0 



AAATGATGCA GGTGCAAATG GTTTATAGCC CTGTGAAGTT CTTAAAGTTT AGAGG CTAAC 
TTACAGAAAT GAATAAGTTG TTTTGTTTTA TAGCCCGGTA GAGGAGTTAA CCCCAAAGGT 
GATATGGTTT TATTTCCTGT TATGTTTAAC TTAATAATCT TATTTTGGCA TTCTTTTCCC 
ATTGACTATA TACATCTCTA TTTCTCAAAT GTTCATGGAA CTAGCTCTTT TATTTTCCTG 
CTGGTTTGTT CAGTAATGAG TTAAATAAAA CATTGACAGA TACAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA 

<2) INFORMATION FOR SEQ ID NO: 45 i 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2351 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: -linear - • " • " : * 
(ii) MOLECULE TYPE; protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

Met Gin lie Glu Leu Ser Thr Cys Phe Phe Leu Cy$ Leu Leu Arg Phe 
X 5 ' 10 IS 



88 >0 

88 JO 

89 LO 

90 )0 
90 IQ 
90$0 



Cys Phe Ser Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser 
20 25 30 

Trp Asp Tyr Met Gin Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg 
is 40 45 

Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val 
50 55 60 

Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn lie 
65 70 75 80 

Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr He Gin 

85 90 95 

Ala Glu Val Tyr Asp Thr Val Val He Thr Leu Lys Asn Met Ala Ser 
100 105 110 

His Pro Val Ser Leu His Ala val Gly - val- Ser Tyr' Trp Lys Ala Ser 
115 120 125 

Glu Gly Ala Glu Tyr Asp Asp Gin Thr Ser Gin Arg Glu Lys Glu Asp 
130 13S 140 



Asp Lys Val Phe Pro Gly Gly Ser 
145 150 

Lys Glu Asn Gly Pro Met Ala Ser 

165 

Tyr Leu Ser His Val Asp Leu Val 
180 



His Thr Tyr Val Trp Gin Val Leu 
155 160 

Asp Pro Leu Cys Leu Thr Tyr Ser 
170 175 

Lys Asp Leu Asn Ser Gly Leu He 
18S 190 



Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr 
195 200 205 



Gin Thr Leu His Lys Phe He Leu Leu Phe Ala Val phe Asp Glu Gly 
210 215 220 



Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gin Asp Arg Asp 
225 230 235 240 

Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr 

245 * 250 255 

Val Asn Arg Ser Leu Pro Gly Leu He Gly Cys His Arg Lys Ser Val 
260 265 270 

Tyr Trp His Val lie Gly Met Gly Thr Thr Pro Glu Val His Ser He 
275 • " 280 285 

Phe Leu Glu Gly His Thr Phe Leu val Arg Asn His Arg Gin- Ala Ser 
290 295 * 300 

Leu Glu He Ser Pro He Thr phe Leu Thr Ala Gin Thr Leu Leu Met 
305 310 315 320 

Asp Leu Gly Gin Phe Leu Leu Phe Cys His He Ser Ser His Gin His 

325 330 . 335 

Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro 
340 345 350 

Gin Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp 
355 360 365 

Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser 
370 375 380 

Pro Ser Phe He Gin He Arg Ser Val Ala Lys Lys His Pro Lys Thr 
385 390 395 400 

Trp Val- His Tyr He. Ala. Ala Glu! Glu Glu' Asp Trp Asp Tyr Ala Pro 

405 410 415 

Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gin Tyr Leu Asn 
420 425 430 

Asn Gly Pro Gin Arg He Gly Arg Lys Tyr Lys Lys Val Arg Phe Met 

435 440 445 

Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala lie Gin His Glu 
450 455 460 

Ser Gly lie Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu 
465 470 475 480 



Leu He He Phe Lys Asn Gin Ala Ser Arg Pro Tyr .Asn He Tyr Pro 

435 490 495 



His Gly He Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys 
500 SOS 510 

Gly Val Lys His Leu Lys Asp Phe Pro He Leu Pro Gly Glu lie Phe 
515 520 525 

Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp 
530 535 * 540 

Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg 
545 550 555 560 

Asp Leu Ala Ser Gly Leu He Gly Pro Leu Leu He Cys Tyr Lys Glu 

565 " 570 575 

Ser Val Asp Gin Arg Gly Asn Gin He Met: Ser Asp Lys Arg- Asn Val 
580 585 590 

He Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu 
595 600 605 

Asn He Gin Arg Phe Leu Pro Asn Pro Ala Gly Val Gin Leu Glu Asp 
610 615 620 

Pro Glu Phe Gin Ala Ser Asn He Met His Ser He Asn Gly Tyr Val 
625 630 615 640 

Phe Asp Ser Leu Gin Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp 

645 650 655 

Tyr He Leu Ser He Gly Ala Gin Thr Asp Phe Leu Ser Val Phe Phe 
660 665 670 

Ser Gly Tyr Thr Phe Lys His Lys Met: Val Tyr Glu Asp Thr Leu Thr 
675 680 595 

Leu Phe Pro Phe Ser Gly Glu Thr Val 'Phe Met Ser Met Glu Asn Pro 
690 695 700 

Gly Leu Trp He Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly 
70S 710 715 720 

Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp 

725 730 735 

Tyr Tyr Glu Asp Ser Tyr Glu Asp He Ser Ala Tyr Leu Leu Ser Lys 
740 745 750 

Asn Asn Ala He Glu Pro Arg Ser Phe Ser Gin Asn Ser Arg His Pro 
755 760 765 

Ser Thr Arg Gin Lys Gin Phe Asn Ala Thr Thr lie. Pro Glu Asn Asp 
770 775 780 



lie Glu Lys Thr Asp Pro Trp Phe Ala His Arg Thr Pro Met Pro Lys 
785 790 795 800 

He Gin Asn Val Ser Ser Ser Asp Leu Leu Met Leu Leu Arg Gin Ser 

805 810 815 

Pro Thr Pro His Gly Leu Ser Leu ger Asp Leu Gin Glu Ala Lys Tyr 
320 825 830 

Glu Thr Phe Ser Asp Asp Pro Ser Pro Gly Ala He Asp Ser Asn Asn 
835 640 - 84S 

Ser- Leu Sex Glu Met Thr His Phe Arg Pro Gin Leu His His Ser Gly 
850 855 8SQ 

Asp Met Val Phe Thr Pro Glu Ser Gly Leu Gin Leu Arg Leu -Asn Glu 
865 870 * 875 880 

Lys Leu Gly Thr Thr Ala Ala Thr Glu Leu Lys Lys Leu Asp Phe Lys 

885 890 Q35 

val Ser Ser Thr Ser Asn Asn Leu He Ser Thr lie pro Ser Asp A$n 
900 905 . 9io. 

Leu Ala Ala Gly Thr Asp Asn Thr Ser Ser Leu Gly Pro Pro Ser Met 
515 920 925 

Pro Val His Tyr Asp Ser Gin Leu Asp Thr Thr Leu Phe Gly Lys Lys 
930 935 940 

Ser Ser Pro Leu Thr Glu Ser Gly Gly Pro Leu Ser Leu Ser Glu Glu 
945 950 955 960 

Asn Asn Asp Ser Lys Leu Leu Glu Ser Gly Leu Met Asn Ser Gin Glu 

965 970 375 

Ser Ser Trp Gly. Lys Asn Val Ser Ser Thr Glu Ser ' Gly Arg Leu Phe 
960 985 990 

Lys Gly Lys Arg Ala His Gly Pro Ala Leu Leu Thr Lys Asp Asn Ala 
995 1000 1005 

Leu Phe Lys Val Ser lie Ser Leu Leu Lys Thr Asn Lys Thr Ser Asn 
1010 1015 1020 

Asn Ser Ala Thr Asn Arg Lys Thr His lie Asp Gly Pro Ser Leu Leu 
1025 1030 1035 1040 

lie Glu Asn Ser Pro Ser val Trp Gin Asn lie Leu Glu Ser Asp Thr 

1045 1050 1055 

Glu Phe Lys Lys Val Thr Pro Leu lie His Asp Arg Met Leu Met Asp 
1060 1065 1070 



Lys Asn Ala Thr Ala. Leu Arg Leu Asn His Met Ser Asn Lys Thr Thr 
1075 1080 1085 



Ser Ser Lys Asn. Met Glu Met Val Gin Gin Lys Lys Glu Gly Pro lie 
1090 1Q$5 1100 

Pro Pro Asp Ala Gin Asn Pro Asp Jfet Ser Phe Phe Lys Met: Leu Phe 
1105 1110 1115 1120 

Leu Pro Glu Ser Ala Arg Trp lie Gin Arg Thr His Gly Lys Asn Ser 

1125 1130 ' 1135 

Leu Asn Ser Gly Gin Gly Pro Ser Pro Lys Gin Leu val Ser Leu Gly 
1140 ' ' 1145 1150 

Pro Glu Lys Ser Val Glu Gly Gin Asn Phe Leu Ser Glu Lys Asn Lys 
1155 1160 1165 

Val Val val Gly Lys Gly Glu Phe Thr Lys Asp Val Gly Leu Lys Glu 
1170 1175 1180 

Met Val Phe Pro Ser Ser Arg Asn Leu Phe Leu Thr Asn Leu Asp Asn 
1185 1190 1195 1200 

Leu His Glu Asn Asn Thr His Asn Gin Glu Lys Lys He Gin Glu Glu 

1205 1210 1215 

lie Glu Lys Lys Glu Thr Leu lie Gin Glu Asn Val val Leu Pro Gin. 
1220 1225 1230 

lie His Thr Val Thr Gly Thr Lys Asn Phe Met Lys Asn Leu Phe Leu 

1235 1240 1245 

Leu Ser Thr Arg Gin Asn Val Glu Gly Ser Tyr Asp Gly Ala Tyr Ala 
1250 1255 1260 

Pro yal ' Leu' Gin Asp Phe Arg Ser Leu Asn Asp Ser Thr Asn Arg Thr 
1265 1270 1275 1280 

Lys Lys His Thr Ala His Phe Ser Lys Lys Gly Glu Glu Glu Asn Leu 

1285 1290 1295 

Glu Gly Leu Gly Asn Gin Thr Lys Gin lie Val Glu Lys Tyr Ala Cys 
1300 1305 1310 

Thr Thr Arg He Ser Pro Asn Thr Ser Gin Gin Asn Phe Val Thr Gin 
1315 1320 1325 

Arg Ser Lys Arg Ala Leu Lys Gin Phe Arg Leu Pro Leu Glu Glu Thr . 
1330 1335 1340 



Glu Leu Glu Lys Arg He He Val Asp Asp Thr Ser .Thr Gin Trp Ser 
1345 1350 1355 1360 



Lys Asn Met Lys His Leu Thr Pro Ser Thr Leu Thr Gin lie Asp Tyr 

1365 1370 1375 

Asn Glu Lys Glu Lys Gly Ala lie Thr Gin Ser Pro Leu Ser Asp Cys 
1380 1385 1330 

Leu Thr Arg Ser His Ser lie Pro pin Ala Asn Arg Ser Pro Leu Pro 
1395 1400 1405 

He Ala Lys val Ser Ser Phe Pro Ser He Arg Pro He Tyr Leu Thr 
1410 1415 1420 ■ 

Arg Val Leu Phe Gin Asp Asn Ser Ser His Leu Pro Ala Ala Ser Tyr 
1425 1430 " 143S 1440 

Arg Lys Lys Asp Ser Gly Val Gin Glu Ser Ser His Phe Leu- Gin Gly 

1445 ' 1450 1455 

Ala Lys Lys Asn Asn Leu Ser Leu Ala lie Leu Thr Leu Glu Met Thr 
1460 1465 1470 

Gly Asp Gin Arg Glu Val Gly Ser Leu Gly Thr Ser Ala 'Thr Asn Ser 
1475 1480 14Q5 

Val Thr Tyr Lys Lys Val Glu Asn Thr Val Leu Pro Lys Pro Asp Leu 
.1430 1495 1500 

Pro Lys Thr Ser Gly Lys Val Glu Leu Leu Pro Lys Val His lie Tyr 
1505 1510 1515 1520 

Gin Lys Asp Leu Phe Pro Thr Glu Thr Ser Asn Gly Ser Pro Gly His 

1525 1530 1535 

Leu Asp Leu Val Glu Gly Ser Leu Leu Gin Gly Thr Glu Gly Ala He 
1540 1545 1550 

Lys Trp Asn Glu Ala Asn Arg Pro Gly Lys ' Val Pro Phe Leu Ar.g Val ' 
1555 1560 1565 

Ala Thr Glu Ser Ser Ala Lys Thr Pro Ser Lys Leu Leu Asp Pro Leu 
1570 1575 1580 

Ala Trp Asp Asn His Tyr Gly Thr Gin He Pro Lys Glu Glu Trp Lys 
1585 1590 1595 1600 

Ser Gin Glu Lys Ser Pro Glu Lys Thr Ala Phe Lys Lys Lys Asp Thr 

1605 1610 1615 

lie Leu Ser Leu Asn Ala Cys Glu Ser Asn His Ala He Ala Ala He 
1620 1625 1630 

Asn Glu Gly Gin Asn Lys Pro Glu He Glu Val Thr Trp Ala Lys Gin 
1535 1640 1645 



Gly Arg Thr Glu Arg Leu Cys Ser Gin Asn Pro Pro Val Leu Lys Arg 
1650 1655 1660 

His Gin Arg Glu lie Thr Arg Thr Thr Leu Gin Ser Asp Gin Glu Glu 
3-665 1670 1675 1630 

lie Asp Tyr Asp Asp Thr lie Ser Val Glu Met Lys Lys Glu Asp Phe 

1665 * 1690 1695 

Asp lie Tyr Asp Glu Asp Glu Asn Gin Ser Pro Arg Ser Phe Gin Lys 
1700 1705 • 1710 

Lys Thr Arg His Tyr Phe Tie Ala Ala Val Glu Arg Leu Trp Asp Tyr 
1715 ' " 1720 ' 1725 

Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gin- Ser Gly 
1730 1735 ' 1740 

Ser Val Pro Gin Phe Lys Lys Val Val Phe Gin Glu Phe Thr Asp Gly 
1745 1750 1755 1760 

Ser Phe Thr Gin Pro Leu Tyr Arg Gly Glu Leu Asn Glu His Leu Gly 

1765 1770 . 1775 

Leu Leu Gly Pro Tyr lie Arg Ala Glu Val Glu Asp Asn lie Met Val 
1780 1785 1790 

Thr Phe Arg Asn Gin Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Ser Leu 
1795 1800 isos 

lie Ser Tyr Glu Glu Asp Gin Arg Gin Gly Ala Glu Pro Arg Lys Asn 
1810 1815 1820 

Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val Gin His 
1825 1830 1835 1840 

His Met Ala- Pro- Thr Lys Asp Glu. Phe Asp Cys Lys Ala Trp Ala Tyr 

1845 1850 1QS5 

Phe Ser Asp Val Asp Leu Glu Lys Asp val His Ser Gly Leu lie Gly 
1860 1865 1870 

Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His Gly Arg 
1575 1880 1885 

Gin Val Thr Val Gin Glu Phe Ala Leu Phe Phe Thr lie Phe Asp Glu 
1890 1895 1900 

Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys Arg Ala 
1905 1910 1915 1920 



Pro Cys Asn lie Gin Met Glu Asp Pro Thr Phe Lys .Glu Asn Tyr Arg 

1925 1930 1935 



Phe His Ala lie Asn Gly Tyr He Men Asp Thr Leu Pro Gly Leu Val 
1940 1945 1950 



Met Ala Gin Asp Gin Arg He Arg Trp Tyr Leu Leu Ser Met Gly Ser 
1955 I960 1965 

Asn Glu Asn lie His Ser He His Phe Ser Gly His val Phe Thr Val 
1970 1975 * 1980 

Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr Pro Gly 
1985 1990 ' 1995 - 2000 

- ^ ■ ' * * 

Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly He Trp Arg 

2005 ' " 2010 2015 

Val Glu Cys Leu lie Gly Glu His Leu His Ala Gly Met Ser- Thr Leu 
2020 * 2025 2030 

Phe Leu Val Tyr Ser Asn Lys Cys Gin Thr Pro Leu Gly Met Ala Ser 
2035 2040 2045 

Gly His He Arg Asp Phe Gin He Thr Ala Ser Gly Gin Tyr Gly Gin 
2050 2055 2060 

Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser He Asn Ala 
2065 2070 2075 2080 

Trp Ser Thr Lys Glu Pro Phe Ser Trp He Lys Val Asp Leu Leu Ala 

2085 2090 2095 

Pro Met He He His Gly He Lys Thr Gin Gly Ala Arg Gin Lys Phe 
2100 2105 211G 

Ser Ser Leu Tyr He Ser Gin Phe He He Met Tyr Ser Leu Asp Gly 
2115 2120 2125 

Lys Lys Trp Gin -Thr. Tyr Arg Gly Asn Ser Thr Gly Thr Leu Met Val 
2130 2135 2140' 

Phe Phe Gly Asn Val Asp Ser Ser Gly He Lys His Asn He Phe Asn 
2145 2150 2155 2160 

Pro Pro He He Ala Arg Tyr He Arg Leu His Pro Thr His Tyr Ser 

2165 2170 2175 

He Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu Asn Ser 
2180 2185 2190 

Cys Ser Met Pro Leu Gly Met Glu Ser Lys Ala He Ser Asp Ala Gin 
2195 2200 2205 



He Thr Ala ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser Pro 
2210 2215 2220 



Ser Lys Ala Arg Leu His Leu Gin Gly Arg Ser Asn Ala Trp Arg Pro 
2225 2230 2235 2240 

Gin Val Asn Asn Pro Lys Glu Trp Leu Gin Val Asp Phe Gin Lys Thr 

2245 2250 2255 

Met Lys Val Thr Gly Val Thr Thr Gin Gly Val Lys Ser Leu Leu Thr 
2260 2255 2270 

Ser Met Tyr Val Lys Glu Phe Leu He Ser Ser Ser Gin Asp Gly His 
2275 2280' 2285 - 

Gin Trp Thr Leu Phe Phe Gin Asn Gly Lys Val Lys Val Phe Gin Gly 
2290 2295 2300 

Asn Gin Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro Pro Leu 

2305 2310 ' 2315 2320 

Leu Thr Arg Tyr Leu Arg He His Pro Gin Ser Trp Val His Gin lie 

2325 2330 2335 

Ala Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gin Asp Leu Tyr 
2340 2345 2350 

(2) INFORMATION FOR SEQ ID NO: 46: 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4832 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

CTCGAGCTAA AGATATTTTA GAGAAGAATT AACCTTTTGC TTCTCCAGTT GAACATTTGT 
AGCAATAAGT CATGCAAATA GAGCTCTCCA CCTGCTTCTT TCTGTGCCTT TTGCGATTCT lio 
GCTTTAGTGC CACCAGAAGA TACTACCTGG GTGCAQTGGA ACTGTCATGG GACTATATGC 1* 0 

AAAGTGATCT CGGTGAGCTG CCTGTGGACG CAAGATTTCC TCCTAGAGTG CCAAAATCTT 2* 0 

TTCCATTCAA CACCTCAGTC GTGTACAAAA AGACTCTGTT TGTAGAATTC ACGGATCACC 3t 0 

TTTTCAACAT CGCTAAGCGA AGGCCACCCT GGATGGGTCT GCTAGGTCCT ACCATCCAGG ' 3i 0 
CTGAGGTTTA TGATACAGTG GTCATTACAC TTAAGAACAT GGCTTCCCAT CCTGTCAGTC 4^ 0 

TTCATGCTGT TGGTGTATCC TACTGGAAAG CTTCTGAGGG AGCTGAATAT GATGATCAGA 4£ 0 

CCAGTCAAAG GGAGAAAGAA GATGATAAAG TCTTCCCTGG TGGAAGCCAT ACATATGTCT S< 0 

GGCAGGTCCT GAAAGAGAAT GGTCCAATGG CCTCTGACCC ACTGTGCCTT ACCTACTCAT 6(0 
ATCTTTCTCA TGTGGACCTG GTAAAAGACT TGAATTCAGG CCTCATTGGA GCCCTACTAG 6£ 0 

TATGTAGAGA AGGGAGTCTG GCCAAGGAAA AGACACAGAC CTTGCACAAA TTTATACTAC 71 0 

TTTTTGCTGT ATTTGATGAA GGGAAAAGTT GGCACTCAGA AACAAAGAAC TCCTTGATGC 76 0 

AGGATAGGGA TGCTGCATCT GCTCGGGCCT GGCCTAAAAT GCACACAGTC AATGGTTATG 84 0 

TAAACAGGTC TCTGCCAGGT CTGATTGGAT GCCACAGGAA ATCAGTCTAT TGGCATGTGA 9C0 

TTGGAATGGG CACCACTCCT GAAGTGCACT CAATATTCCT CGAAGGTCAC ACATTTCTTG 9£ 0 

TGAGGAACCA TCGCCAGGCG TCCTTGGAAA TCTCGCCAAT AACTTTCCTT ACTGCTCAAA 1,02 0 

CACTCTTGAT GGACCTTGGA CAGTTTCTAC TGTTTTGTCA TATCTCTTCC CACCAACATG 108 0 

ATGGCATGGA AG CTTATGTC AAAGTAGACA GCTGTCCAGA GGAACCCGAA CTACGAATGA 114 0 

AAAA TAAT GA AGAAGCGGAA GACTATGATG ATGATCTTAC TGATTCTGAA ATGGATGTGG 12 0 0 

TCAGGTTTGA TGATGACAAC TCTCCTTCCT TTATCCAAAT TCGCTCAGTT GCCAAGAAGC 126 0 



0 



0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



ATCCTAAAAC TTGGGTACAT TACATTGCTG CTGAAGAGGA GGACTGGGAC TATGCTCCCT 13: >o 

TAGTCCTCGC CCCCGATGAC AGAAGTTATA AAAGTCAATA TTTGAACAAT GGCCCTCAGC 13; 10 

GGATTGGTAG GAAGTACAAA AAAGTCCGAT TTATGGCATA CACAGATGAA ACCTTTAAGA 14<t0 

CTCGTGAAGC TATTCAGCAT GAATCAGGAA TCTTGGGACC TTTACTTTAT GGGGAAGTTG 15UQ 

GAGACACACT GTTGATTATA TTTAAGAATC AAGCAAGCAG ACCATATAAC ATCTACCCTC 15<;0 

ACGGAATCAC TGATGTCCGT CCTTTGTATT CAAGGAGATT AC CAAAAGGT GTAAAACATT lglo 

TGAAGGATTT TCCAATTCTG CCAGGAGAAA TATTCAAATA TAAATGGACA GTGACTGTAG ie|o 

AAGATGGGCC AACTAAATCA GATCCTCGGT GCCTGACCCG CTATTACTCT AGTTTCGTTA 17 

ATATGGAGAG AGATCTAGCT TCAGGACTCA TTGGCCCTCT CCTCATCTGC TACAAAGAAT 18 $0 

CTGTAGATCA AAGAGGAAAC CAGATAATGT CAGACAAGAG GAATGTCATC CTGTTTTCTG 18 CO 

TATTTGATGA GAACCGAAGC TGGTACCTCA CAGAGAATAT ACAACGCTTT CTCCCCAATC 13:\Q 

CAGCTGGAGT GCAGCTTGAG GATCCAGAGT TCCAAGCCTC CAACATCATG CACAGCATCA 19* 0 
ATGGCTATGT TTTTGATAGT TTGCAGTTGT CAGTTTGTTT GCATGAGGTG GCATACTGGT " 2Q< 

ACATTCTAAG CATTGGAGCA CAGACTGACT TCCTTTCTGT CTTCTTCTCT GGATATACCT 21 C 

TCAAACACAA AATGGTCTAT GAAGACACAC TCACCCTATT CCCATTCTCA GGAGAAACTG 21* 

TCTTCATGTC GATGGAAAAC CCAGGTCTAT GGATTCTGGG GTGCCACAAC TCAGACTTTC 22; 

GGAACAGAGG CATGACCGCG TTACTGAAGG TTTCTAGTTG TGACAAGAAC ACTGGTGATT 22 1 

ATTACGAGGA CAGTTATGAA GATATTTCAG CATACTTGCT GAGTAAAAAC AATGCCATTG 23 i 

AACCAAGAAG CTTCTCCCAG AACCCACCAG TCTTGAAACG CCATCAACGG GAAATAACTC 24( 

GTACTACTCT TCAGTCAGAT CAAGAGGAAA TTGACTATGA TGATAC CAT A TCAGTTGAAA 24 € 

TGAAGAAGGA AGATTTTGAC ATTTATGATG AGGATGAAAA TCAGAGCCCC CGCAGCTTTC 25; 

AAAAGAAAAC ACGACACTAT TTTATTGCTG CAGTGGaGaG GCTCTGGGAT TATGGGATGA 25E 

GTAGCTCCCC ACATGTTCTA AGAAACAGGG CTCAGAGTGG CAGTGTCCCT CAGTTCAAGA 264 

AAGTTGTTTT CCAGGAATTT ACTGATGGCT CCTTTACTCA GCCCTTATAC CGTGGAGAAC 27 C 0 

TAAATGAACA TTTGGGACTC CTGGGGCCAT ATATAAGAGC AGAAGTTGAA GATAATATCA 276 0 

TGGTAACTTT CAGAAATCAG GCCTCTCG7C CCTATTCCTT CTATT CTAG C CTTATTTCTT 282 0 

ATGAGGAAGA TCAGAGGCAA GGAGCAGAAC CTAGAAAAAA CTTTGTCAAG CCTAATGAAA 288 0 

CCAAAAGTTA CTTTTGGAAA GTGCAACATC ATATGGCACG CACTAAAGAT GAGTTTGACT 294 0 

GCAAAGCCTG GGCTTATTTC TCTGATGTTG ACCTGGAAAA AGATGTGCAC TCAGGCCTGA 30 a a 

TTGGACCCCT TCTGGTCTGC CACACTAACA CACTGAACCC TGCTCATGGG AGACAAGTGA 306 0 

CAGTACAGGA ATTTGCTCTG TTTTTCACCA TCTTTGATGA GACCAAAAGC TGGTACTTCA 3120 

CTGAAAATAT GGAAAGAAAC TGCAGGGCTC CCTGGAATAT CCAGATGGAA GATCC CACTT 3180 

TTAAAGAGAA TTATCGCTTC CATGCAATCA ATGGCTACAT AATGGATACA CTACCTGGCT 3240 

TAGTAATGGC TCAGGATCAA AGGATTCGAT GGTATCTGCT CAGCATGGGC AGCAATGAAA 33 00 

ACATC CATTC TATTCATTTC AGTGGACATG TGTTCACTGT ACGAAAAAAA GAGGAGTATA 336 0 

AAATGGCACT GTACAATCTC TATCCAGGTG TTTTTGAGAC AGTGGAAATG TTACCATCCA 342 0 
AAGCTGGAAT TTGGCGGGTG GAATGCCTTA TTGGCGAGCA TCTACATGCT GGGATGAGCA . 348 0 

CACTTTTTCT GGTGTACAGC AATAAGTGTC AGACTCCCCT GGGAATGGCT TCTGGACACA 354 0 

TTAGAGATTT TCAGATTACA GCTTCAGGAC AATATGGACA GTGGGCC CCA AAGCTGGCCA 3 60 3 

GACTTCATTA TTCCGGATCA ATCAATGCCT GGAGCACCAA GGAGCCCTTT TCTTGGATCA 356 3 

AGGTGGATCT GTTGGCACCA ATGATTATTC ACGGCATCAA GACCCAGGGT GCCCGTCAGA 372 3 

AGTTCTCCAG CCTCTACATC TCTCAGTTTA TCATCATGTA TAGTCTTGAT GGGAAGAAGT 378 3 

GGCAGACTTA TCGAGGAAAT TCCACTGGAA CCTTAATGGT CTTCTTTGGC AATGTGGATT 384 3 

CATCTGGGAT AAAACACAAT ATTTTTAACC CTCCAATTAT TGCTCGATAC ATCCGTTTGC 390 ) 

ACCCAACTCA TTATAGCATT CGCAGCACTC TTCG CATGGA GTTGATGGGC TGTGATTTAA 396 ) 

ATAGTTGCAG CATGCCATTG GGAATGGAGA GTAAAGCAAT ATCAGATGCA CAGATTACTG 4 02 ) 

CTTCATCCTA CTTTACCAAT ATGTTTGCCA CCTGGTCTCC TTCAAAAGCT CGACTTCACC 4 OS ) 

TCCAAGGGAG GAGTAATGCC TGGAGACCTC AGGTGAATAA TCCAAAAGAG TGGCTGCAAG 414 ) 

TGGACTTCCA GAAGACAATG AAAGTCACAG GAGTAACTAC TCAGGGAGTA AAATCTCTGC 42 0) 

TT AC CAG CAT GTATGTGAAG GAGTTCCTCA TCTCCAGCAG TCAAGATGGC CATCAGTGGA 42 6 ) 

CTCTCTTTTT TCAGAATGGC AAAGTAAAGG TTTTTCAGGG AAATCAAGAC TCCTTCACAC 432 ) 

CTGTGGTGAA CTCTCTAGAC CCACCGTTAC TGACTCGCTA CCTTCGAATT CACCCCCAGA 43 8 ) 

GTTGGGTGCA CCAGATTGCC CTGAGGATGG AGGTTCTGGG CTGCGAGGCA CAGGACCTCT 4 4411 

ACTGAGGGTG GCCACTGCAG CACCTGCCAC TGCCGTCACC TCTCCCTCCT CAGCTCCAGG 4500 



GCAGTGTCCC TCCCTGGCTT GCCTTCTACC 
GAAGCCTCCT GAATTAACTA TCATCAGTCC 
GCATCCAATT TAACTTAACT CTTACCTATT 
TTCCAA'TATA actaggcaaa aagaagtGag 
AAAAGTTAGG CCTCTCAGAG TCACCACTTC 
TTTGAAAAAG ATATTTATGA TGTTGCGGCC 



tttgtgctaa atcctagcag acactgcctt 

TGCATTTCTT TGGTGGGGGG CCAGGAGGGT 

TTCTGCAGCT gctcccagat tactccttcc 

GAGAAACCTG CATGAAAGCA TTCTTCCCTG 
CTCTGTTGTA GAAAAACTAT GTGATGAAAC 
GC 



(2) INFORMATION FOR SEQ ID NO: 47: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14S7 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) -TOPOLOGY: -linea-r - ■ , 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47; 

Met Gin lie Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe 
1 5 " 10 is 

Cys Phe Ser Ala Thr Arg Arg Ty^ Tyr Leu Gly Ala Val Glu Leu Ser 
20 25 30 

Trp Asp Tyr Met Gin Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg 
35 40 45 

Phe Pro Pro Arg Val Pro Lys Ser Phe Pro phe Asn Thr Ser Val Val 
50 55 60 

Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn He 
65 70 75 80 

Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr He Gin 

85 90 95 

Ala Glu Val Tyr Asp Thr Val val He Thr Leu Lys Asn Men Ala Ser 
100 105 no 

His Pro Val Ser Leu' His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser 
115 120 125 

Glu Gly Ala Glu Tyr Asp Asp Gin Thr Ser Gin Arg Glu Lys Glu Asp 
130 135 140 

Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gin Val Leu 
145 150 155 iso 

Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser 

165 170 175 

Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu He 
180 1B5 190 

Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala .Lys Glu Lys Thr 
195 200 205 



Gin Thr Leu His Lys Phe He Leu Leu Phe Ala Val Phe Asp Glu Gly 
210 215 220 

Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gin Asp Arg Asp 
225 230 235 240 

Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr 

245 * 250 2 55 

Val Asn Arg Ser Leu Pro Gly Leu He Gly Cys His Arg Lys Ser Val 
2€Q 265 - 270 

Tyr Trp His Val He Gly Met Gly Thr Thr Pro Glu Val His Ser lie 
275 * ' 2B0 285 

Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gin Ala Ser 
290 295 ' 300 

Leu Glu He Ser Pro He Thr Phe Leu Thr Ala Gin Thr Leu Leu Mer 
305 310 315 320 

Asp Leu Gly Gin Phe Leu Leu Phe Cys His He Ser Ser His Gin His 

325 330 .335 

Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro 
340 345 350 

Gin Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp 
355 360 365 

Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser 
370 375 38Q 

Pro Ser Phe lie Gin He Arg Ser Val Ala Lys Lys His Pro Lys Thr 
335 390 395 400 

Trp Val His Tyr He Ala Ala Glu Glu Glu Asp ' Trp Asp Tyr Ala Pro 

405 410 415 

Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gin Tyr Leu Asn 
420 425 430 

Asn Gly Pro Gin Arg He Gly Arg Lys Tyr Lys Lys Val Arg Phe Met 
435 440 445 

Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala He Gin His Glu 
450 455 460 

Ser Gly He Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu 
465 470 475 480 

Leu He He Phe Lys Asn Gin Ala Ser Arg Pro Tyr .Asn He Tyr Pro 

485 490 495 



His GXy He Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys 
500 505 510 

Gly Val Lys His Leu Lys Asp Phe Pro He Leu Pro Gly Glu He Phe 
515 520 525 

Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp 
530 535 * 540 

Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg 
545 550 - 555 - , 560 

Asp Leu Ala Ser Gly Leu He Gly Pro Leu Leu He Cys Tyr Lys Glu 

565 • ' ' 570 575 

Ser Val Asp Gin Arg Gly Asn Gin He Met Ser Asp Lys Arg- Asn Val 
580 ' 585 590 

He Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu 
595 600 60S 

Asn He Gin Arg Phe Leu Pro Asn Pro Ala Gly Val Gin Leu Glu Asp 
610 €15 620 

Pro Glu Phe Gin Ala Ser Asn He Met His Ser He Asn Gly Tyr Val 
625 630 635 640 

Phe Asp Ser Leu Gin Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp 

645 650 655 

Tyr He Leu Ser He Gly Ala Gin Thr Asp Phe Leu Ser Val Phe Phe 
660 665 670 

Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr 
675 680 685 

Leu Phe Pro Phe Ser Gly Glu Thr Val Phe. Met Ser Met Glu Asn Pro 
690 " 695 700 

Gly Leu Trp He Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly 
705 710 715 720 

Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp 

725 730 735 

Tyr Tyr Glu Asp Ser Tyr Glu Asp He Ser Ala Tyr Leu Leu Ser Lys 
740 745 750 

Asn Asn Ala He Glu Pro Arg Ser Phe Ser Gin Asn Pro Pro Val Leu 
755 760 765 

Lys Arg His Gin Arg Glu He Thr Arg Thr Thr Leu .Gin Ser Asp Gin 
770 775 780 



Glu Glu He Asp Tyr Asp Asp Thr He Ser Val Glu Met Lys Lys Glu 
785 790 795 800 

Asp Phe Asp lie Tyr Asp Glu Asp Glu Asn Gin Ser Pro Arg Ser Phe 

805 810 815 

Gin Lys Lys Thr Arg His Tyr Phe lie Ala Ala Val Glu Arg Leu Trp 
820 fc25 830 

Asp Tyr Gly Men Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gin 
835 840 - 8-45 

Ser Gly Ser val Pro Gin Phe Lys Lys Val Val Phe Gin Glu Phe Thr 
850 855 860 

Asp Gly Ser Phe Thr Gin Pro Leu Tyr Arg Gly Glu Leu Asn Glu His 
865 870 * 875 880 

Leu Gly Leu Leu Gly Pro Tyr He Arg Ala Glu Val Glu Asp Asn He 

885 890 895 

Met Val Thr Phe Airg Asn Gin Ala Ser Arg Pro Tyr Ser Phe Tyr Ser 
900 905 910. 

Ser Leu lie Ser Tyr Glu Glu Asp Gin Arg Gin Gly Ala Glu Pro Arg 
915 920 925 

Lys Asn Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val 
930 935 940 

Gin His His Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp 
945 950 955 960 

Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu 

965 970 975 

He Gly Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His 
980 '985 990 

Gly Arg Gin Val Thr Val Gin Glu Phe Ala Leu Phe Phe Thr He Phe 
995 1000 1005 

Asp Glu Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys 
1010 1015 1020 

Arg Ala Pro Cys Asn He Gin Met Glu Asp Pro Thr Phe Lys Glu Asn 
1025 1030 1035 1040 

Tyr Arg Phe His Ala He Asn Gly Tyr He Met Asp Thr Leu Pro Gly 

1045 1050 1055 

Leu val Met Ala Gin Asp Gin Arg He Arg Trp Tyr ,Leu Leu Ser Met 
1060 1065 1070 



Gly Ser Asn Glu Asn He His Ser lie His Phe Ser Gly His Val Phe 
1075 1080 1085 

Thr Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr 
3-090 1095 noo 

Pro Gly Val Phe Glu Thr val Glu Met Leu Pro Ser Lys Ala Gly He 
1105 1110 * 1115 1120 

Tip Arg Val Glu Cys Leu He Gly Glu His Leu His Ala Gly Mec Ser 

1125 1130 H35 

Thr Leu Phe Leu Val Tyr Ser Asn Lys Cy S Gin Thr Pro Leu Gly Met 
H40 " ■ 1145 iiso 

Ala Ser Gly His He Arg Asp Phe Gin He Thr Ala Ser Gly- Gin Tyr 
H55 1160 i l65 

Gly Gin Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser He 
1170 1175 iiso 

Asn Ala Trp Ser Thr Lys Glu Pro Phe Ser Trp He Lys Val Asp Leu 
"S5 1190 ii55 1200 

Leu Ala Pro Met He He His Gly He Lys Thr Gin Gly Ala Arg Gin 

1205 1210 !215 

Lys Phe Ser Ser Leu Tyr He Ser Gin Phe tie He Mec Tyr Ser Leu 
1220 1225 1230 

Asp Gly Lys Lys Trp Gin Thr Tyr Arg Gly , Asn Ser Thr Gly Thr Leu 
1235 1240 1245 

Mec Val Phe Phe Gly Asn val Asp Ser Ser Gly He Lys His Asn He 
1250 1255 1260 

Phe Asn Pro Pro He He Ala Arg Tyr He Arg Leu His' Pro Thr His ■ 
1265 1270 1275 1280 

Tyr Ser He Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu 

1285 1290 1295 

Asn Ser Cys Ser Met Pro Leu Gly Met Glu Ser Lys Ala He Ser Asp 
1300 1305 1310 

Ala Gin He Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp 
1315 1320 1325 

Ser Pro Ser Lys Ala Arg Leu His Leu Gin Gly Arg Ser Asn Ala Trp 
1330 1335 1340 

Arg Pro Gin Val Asn Asn Pro Lys Glu Trp Leu Gin. Val Asp Phe Gin 
1345 1350 1355 1360 



Lys Thr Met Lys Val Thr Gly val Thr Thr Gin Gly Val Lys Ser Leu 

1365 1370 1375 

Leu Thr Ser Met Tyr Val Lys Glu Phe Leu lie Ser Ser Ser Gin Asp 
1380 1385 1390 

Gly His Gin Tip Thr Leu Phe Phe Gin Asn Gly Lys Val Lys Val Phe 
1355 1400* 1405 

Gin Gly Asn Gin Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro 
1410 1415 - .1420 - 

Pro Leu Leu Thr Arg Tyr Leu Arg; He His Pro Gin Ser Trp Val His 
1425 1430 " " ' 1435 1440 

Gin He Ala Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gin Asp Leu 

1443 ' 1450 1455 

Tyr 



(2) INFORMATION FOR SEQ ID NO = 48: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 0 amino -acids 

(B) TYPE: amino acid 

(C) STRANDEDNFESS : single 

(D) TOPOLOGY: linear ■ 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4Q: 

Arg Gly Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr 
15 10 15 

Gly Asp Tyr Tyr Glu Asp Ser Tyr Glu Asp lie Ser Ala Tyr Leu Leu 
20 25 30 

Ser Lys Asn Asn Ala He Glu Pro Arg Ser Phe Ser Gin Asn Ser Arg 
35 40 45 

His Pro Ser Thr Arg Gin Lys Gin Phe Asn Ala Thr Pro Pro Thr Pro 
50 55 60 

Pro Thr Pro Pro Val Leu Lys Arg His Gin Arg Glu He Thr Arg Thr 
65 70 75 80 

Thr Leu Gin Ser Asp Gin Glu Glu He Asp Tyr Asp Asp Thr He Ser 

B5 90 95 

Val Glu Met Lys 
100 



(2) INFORMATION FOR SEQ ID NO =49: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY.- linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:4$ : 
AGAGGCATGA CCGCCTTACT GAAGGTTTCT AGTTGTGACA ' AGAACACTGG TGATTATTAC 
GAGGACAGTT ATGAAGATAT TTCAGCATAC TTGCTGAGTA aaaacaatgc cattgaacca 
AGAAGCTTCT CCCAGAATTC TAGACACC CT AGCACTAGGC AAAAGCAATT TAATGCCACC 
CCTCCTACAC CACCAACCCC ACCAGTACTG AAACGCCATC AACGGGAAAT AACTCGTACT 
ACTCTTCAGT CTGATCAAGA GGAAATTGAC TATGATGATA CCATATCAGT TGAAATGAAG 

(2) INFORMATION FOR SEQ ID NO: SO: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single " 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Cxi) SEQUENCE DESCRIPTION : SEQ ID NO;S0; 

Ser Arg His Pro Ser Thr Arg Gin Lys Gin Phe Asn Ala Thr Pro Pro 
1 - 5 10 ,15 

Val Leu Lys Arg His Gin Arg Glu lie Thr Arg 
20 25 



(2) INFORMATION FOR SEQ ID NO: Si: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

■{xi) SEQUENCE. DESCRIPTION: SEQ ID NO = 51. ; 
TCGCGACACC CTAGCACTAG GCAAAAGCAA TTTAATGCCA CCCCACCAGT CCTGAAACGC 
CATCAACGGG AAATAACGCG T 

(2) INFORMATION FOR SEQ ID NO: 52; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
ACTACTCTTC AATCTGATCA AGAGGAA 



(2) INFORMATION FOR SEQ ID NO: 53: 
(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 43 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DMA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
CGCGCCGCTC GAGTCTACAA TGGCTTTGCC TTTTO CTTTA CTG 

(2) INFORMATION FOR SEQ ID NO: 54: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE; nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY i -linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION:' SEQ ID NO: 54: 
GCGCCCATCG ATTTATTCCT TCCTCCTTAA CCTTTCTTGC AAG 

(2) INFORMATION FOR SEQ ID NO: 55: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CGCGCCGCTC GAGCATCCCA ATGGCCCTGT CCTTTTCTTT ACTGATGG 

(2) INFORMATION FOR SEQ ID NO: 56: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO:56: 
CCATCGATTC AATCCTTCCT CCTTAATCTT TTTTGCAAG 

(2) INFORMATION FOR SEQ ID NO; 57: 
(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH i 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY = linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
CCGGATCCTC TACAATGGCT TTGCCTTTTG CTTTACTG 

(2) INFORMATION FOR SEQ ID NO:58: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4S base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : DNA (genomic) 
(3Ci) SEQUENCE DESCRIPTION: $EQ ID NO:58: 
CGCGCCGGCG GCCGCTTATT CCTTCCTCCT TAACCTTTCT TGCAAG 

(2) INFORMATION FOR SEQ ID NO: 59: 
(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 43 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID'NO:5S: 
CCGGATCCCA TCCPAATGGC C CTGTCCTTT TCTTTACTG& TGG 

(2) INFORMATION FOR SEQ ID NO -60; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
CGCGCCGGCG GCCGCTCAAT CCTTCCTCCT TAATCTTTTT TGCAAG 

(2) INFORMATION FOR SEQ ID NO: SI: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO16I; 
AGCTTGCTGT TTGTGTGCTG CCTCTGAAGT CCACACTGAA CAAACTTCAG CCTACTCATG 
TCCCTAAAAT GGGCAAACAT TGCAAGCAGC 

(2) INFORMATION FOR SEQ ID NO:62: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 70 base pairs 

(B) type z nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 

AAACAGCAAA CACACAGCCC TCCCTGCCTG CTGACCTTGG AGCTGGGGCA GAGGTCAGAG 
ACCTCTCTGA 

(2) INFORMATION FOR SEQ ID NO:63r 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 82 base pairs 
(S) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 



AGCTTCAGAG AGGTCTCTGA CCTCTGCCCC AGCTCCAAGG TCAGCAGGCA GGGAGGGCTG 
TGTGTTTGCT GTTTGCTGCT TG 



(2) INFORMATION FOR SEQ ID NO : 64 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
CAATGTTTGC CCATTTTAGG GACATGAGTA GGCTGAAGTT TGTTCAGTGT GGACTTCAGA 
GGCAGCACAC AAACAGCA r 

(2) INFORMATION FOR SEQ ID NO = 65: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 65: 
AATTCGCTGT TTGTGTGCTG CCTCTGAAGT CCACACTGAA CAAACTTCAG CCTACTCATG 
TCCCTAAAAT GGGCAAACAT TGCAAGCAGC 

(2) INFORMATION FOR SEQ ID NO: €6 1 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:66 ; 
AAACAGCAAA CACACAGCCC TCCCTGCCTG CTGACCTTGG AGCTGGGGCA GAGGTCAGAG 
AC CTCTCTGG 

(2) INFORMATION FOR SEQ ID NO:67: 
<i) SEQUENCE- CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOs67: 
AATTCCAGAG AGGTCTCTGA CCTCTGCCCC AGCTCCAAGG TCAGCAGGCA GGGAGGGCTG 
TGTGTTTGCT GTTTGCTGCT TG 



(2) INFORMATION FOR SEQ ID NO: 68: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
CAATCTTTGC CCATTTTAGG GACATGAGTA GGCTGAAGTT TGTTCAGTGT GGACTTCAGA 
GGCAGCACAC AAACAGCG 

(2) INFORMATION FOR SEQ ID NO: 69; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID- NO: 6$: < * 

CGCGCCGCCC GGGGTAGATC TTGCTACCAG TGG 

(2) INFORMATION FOR SEQ" ID NO: 70: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 70: 
GCGCCCGCGG CCGCCACTGT CCCAGGTCAG TGGTGGTGCC 

(2) INFORMATION FOR SEQ ID NO; 71: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:7l: 
CGCGCCGGCG GCCGCTCTAC AATGGCCTTG ACCTTTGCTT TACTGG 

(2) INFORMATION FOR SEQ ID NO : 72 z 
(i) SEQUENCE CHARACTERISTICS: 

(A) length i 43 base pairs , 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
GCGCCCATCG ATTCATTCCT TACTTCTTAA ACTTTCTTGC AAG 

(2) INFORMATION FOR SEQ ID NO: 73: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii] MOLECULE TYPE i DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73= 
CGCGCCGGCG GCCGCTCTAC AATGGCCTTG ACCTTTGCTT TACTGG 



(2) INFORMATION FOR SEQ ID NO; 74; 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 43 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID*NO:74: 
GCGCCCATCG ATTCATTCCT TACTTCTTAA ACTTTCTTGC AAG 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: >. 

(A) LENGTH: 6 amino acids 

(B) TYPE : amino acid - 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 
Asn Ser Arg His Pro Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 76: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid* 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76; 

Gin Asn Pro Pro Val Leu Lys Arg His Gin Arg Glu lie Thz: 
15 10 

(2) INFORMATION FOR SEQ ID NO: 77: 
(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS: single 1 ■ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
AATTCGCGAC ACCCTAGC 

(2) INFORMATION FOR SEQ ID NO: 78; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78= 
CAAAACCCAC CAGTCTTGAA AC GC CAT CAA CGGGAAATAA CG 



(2) INFORMATION FOR SEQ ID NO: 79: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 32 base pairs 

(B) TYPE: nucleic acid 

(G) STRANDEDNESS : Single 
(D) TOPOLOGY ; linear 
(ii) MOLECULE TYPE; DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION- SEQ ID NO: 79 
GCGCTGTGGG ATCGGTTTTG GGTGGTCAGA AC 

► 

(2) INFORMATION FOR SEQ ID NO: 80: 
(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 24 base pairs 

(B) typEl nucleic acid , , -- 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear - 

(ii) MOLECULE TYPE: • DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: SO 
TTTGCGGTAG TTGCCCTTTA TTGC 



(2) INFORMATION FOR SEQ ID NO: 81: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : Single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 81: 
Arg Thr Leu Gin Ser Asp 
1 5 



(2) INFORMATION FOR SEQ ID NO 2 82 1 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY? linear 

(ii) MOLECULE TYPE: DNA (genomic) 
• (xi) SEQUENCE DESCRIPTION; SEQ IDNQ:S2:' 
CGTACTCTTC AGTCT 

(2) INFORMATION FOR SEQ ID NO: 83: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS z single 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
GCATGAGAAG TCAGACTAG 

(2) INFORMATION FOR SEQ ID NO: 84: 
(i) SEQUENCE CHARACTERISTICS z 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 



(C) STRAND EDNESS = single 

(D) topology s linear 

(xi) SEQUENCE DESCRIPTION" : S£Q ID NO: 34 
TGCATGCCTG CAGGTC 



